-
Notifications
You must be signed in to change notification settings - Fork 301
feat: Add embedding model integration tests with CI workflow #729
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
✅ Deploy Preview for vllm-semantic-router ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
fe1d06d to
fd964dd
Compare
👥 vLLM Semantic Team NotificationThe following members have been identified for the changed files in this PR and have been automatically assigned: 📁
|
dd9cf1e to
0186df2
Compare
|
@OneZero-Y PTAL, thanks |
0186df2 to
c88cc71
Compare
Signed-off-by: Yehudit Kerido <[email protected]>
c88cc71 to
db0a6dd
Compare
Optional: HF_TOKEN ConfigurationThis PR's tests support graceful fallback and pass without HF_TOKEN (Qwen3-only mode). To enable full Gemma model testing, maintainers can add This enables the |
Xunzhuo
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks like there is a gap between the tests and target goal, initially, we want to add testcases under integration test to verify the intelligentroute with signal embeddings
https://vllm-semantic-router.com/docs/api/crd-reference#embeddingsignal
to make sure signal embeddings works, this should be added to https://github.com/vllm-project/semantic-router/tree/main/e2e/testcases
Thanks for clarifying! I want to explain how I interpreted #715, so you might see how I saw things and it will explain the gap. Why I Built Infrastructure TestsThe issue explicitly states:
Under "General Embedding Signal Tests" → "Integration Tests", it lists:
I delivered exactly these tests as they were the most explicit and actionable requirements. Where I Got ConfusedThe "Acceptance Criteria" mentions PII detection/domain classification but didn't specify:
I interpreted "separate from E2E tests" as "don't add to Questions
I'm happy to add the EmbeddingSignal E2E test cases (by the way some are already exist need to add more) - just need to know if the current work should be kept or discarded. Looking forward to your guidance! |
|
@yehudit1987 instead of creating a new workflow for these tests, can you use the existing test and build workflow? The tests can also be added to the candle-binding directory instead of semantic-router, since these tests don't exercise the router components. Thanks |

Add comprehensive test suite for embedding model infrastructure covering
concurrency, memory, and performance. Includes 29 integration tests,
GitHub Actions workflow, and complete documentation.
Resolve issue: #715